Using SAS to Analyze the Summary Data
نویسنده
چکیده
Most of the time, we have the raw data to conduct the necessary statistical analysis in SAS®. However, when there is only summary data available, some additional SAS coding is necessary in order to perform the hypothesis test. SAS procedures, as well as simplified macros (%SUM_TTEST, %SUM_ANOVA, %P_ANOVA, %SUM_CHI) with examples, will be discussed in this paper for chi-square test, two sample t-test, and analysis of variance with summary data as input. INTRODUCTION Before initiating a new study, there is often extensive literature review to retrieve background information, compare existent findings, and support the significance of the study. Since most of the papers only present the summary statistics (sample sizes, means, standard deviations, percentages, etc.), it is very helpful if we have the right tools to generate other statistics in a quick and convenient way. For example, if means and standard deviations are known for two independent groups, it is always interesting to know whether there is statistical significant difference in the mean between the two groups. Similarly, if the number of events and sample size are known for two groups, the relative risk, odds ratio, or risk difference between those groups, could also be calculated. This paper discusses the concept of statistical analysis using the summary data and introduces a series of simplified macros for such analysis. ANALYSIS OF SUMMARY CONTINUOUS DATA The continuous variable is usually reported as sample size ( n ), mean (Y ) and standard deviation ( s ). If there are two or more groups of data, additional hypothesis tests of the existence of differences among groups are sometimes very helpful in understanding the background of the research field. The t and F statistics as well as their degree of freedom can be calculated from the sufficient summary statistics Y n, and s (Table 1 and 2); therefore always as to construct the hypothesis tests without the original individual level data. Table 1. Test Statistic for Independent Two Samples T-TEST Assumption t Statistic DF Equal Variance , 1 1
منابع مشابه
Creating Clinical Trial Summary Tables Containing P-Values: A Practical Approach Using Standard SAS Macros
P-value is a key criterion for evaluating the effectiveness and safety of new drugs in clinical trials, particularly in comparative studies. However, p-values are generally not presented in data summary tables generated with SAS software, because of the complexity of incorporating p-values into a formatted table that contains summary statistics, such as mean, proportion, or standard deviation. ...
متن کاملSimplifying the Analysis of Complex Survey Data Using the SAS
Large sample-based surveys often have complex sample designs, with design features including stratification, clustering, multi-stage sampling, and unequal probability of selection of observations. The calculation of the associated sampling weights often involves nonresponse adjustments and raking to external control totals. The analysis usually includes descriptive statistics such as frequencie...
متن کاملA SAS Macro to Analyze Data From a Matched or Finely Stratified Case-Control Design
A matched case-control design is a common approach used to assess diseaseexposure relationships, and is often a more efficient method than an unmatched design. However, for the valid analysis of such an approach, a modeling technique that incorporates the matched nature of the data is needed. This prohibits the use of a standard unconditional logistic regression analysis generally available in ...
متن کاملA Macro for Calculating Summary Statistics on Left Censored Environmental Data using the Kaplan-Meier Method
Calculating summary statistics such as the mean, standard deviation, and an upper confidence limit on the mean is straightforward when the data values are known. However, environmental data often are reported from the analytical laboratory as left censored, meaning the actual concentration for a given contaminant was not detected above the method detection limit. Therefore, the true concentrati...
متن کاملUsing SAS Software to Analyze Sybase Performance on the Web
This paper provides a web-based system using SAS, HTML and CGI/PERL to provide rudimentary and complex Sybase DBMS performance metrics for Unix based system operations. Sybase SQL Server performance data is collected by Sybase Historical Server allowing for the collection of performance information with minimal impact on the server. The SAS System (Base SAS, Macro, STAT and SAS/Graph) is especi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006